Extracting Latent Weblog Communities
نویسنده
چکیده
Abstract: I propose the concept of a latent weblog community (LBC), as a means to promote the autonomous organization of knowledge on the Internet. Such communities can be illustrated in terms of bipartite graphs based on weblog update information, and they can effectively function to create meeting spaces for bloggers who write about similar or closely related topics but do not know each other. To extract these communities from blogspace, I developed a partitioning algorithm known as the Weakest Pair (WP) algorithm, which separates the weakest pairs of bloggers and webpages, respectively, using co-citation information. As a result of numerical evaluation, the WP algorithm is more effective than the Shortest Path Betweenness (SPB) algorithm in terms of information loss and completeness of bipartite graphs. I will provide three examples of LBC extracted using the WP algorithm and report its secondary effects, i.e. personae detection, the detection of a set of weblogs owned by a single blogger.
منابع مشابه
Community Detection Based on Topic Distance in Social Tagging Networks
Research on the community detection in social tagging networks has attracted much attention in the last decade. Extracting the hidden topic information from tags provides a new way of thinking for community detection in social tagging networks. In this paper, a topic tagging network by extracting several topics from the tags through using the Latent Dirichlet Allocation (LDA) model is built fir...
متن کاملConceptual Evaluation of Weblog as a Computer Mediated Communication Application
Weblog is fast becoming an unprecedented mode of communication and self-expression for both individuals and communities. It offers users an avenue to express their opinions regarding anything that come to their minds, from current news items to documenting their lives and thoughts. Bewildered by its rapid growth and widespread popularization, researchers have sought to understand the “exquisite...
متن کاملCo-clustering for Weblogs in Semantic Space
Web clustering is an approach for aggregating web objects into various groups according to underlying relationships among them. Finding co-clusters of web objects in semantic space is an interesting topic in the context of web usage mining, which is able to capture the underlying user navigational interest and content preference simultaneously. In this paper we will present a novel web co-clust...
متن کاملDiscovering Weblog Communities: A content- and topology-based approach
Weblogs have become a leading form of self-publication on the web. Personal weblogs are often considered to represent a person, and the links between webogs can naturally be given a social interaction. Against this background, finding a community around a given weblog—i.e., identifying a set of weblogs that forms a natural group together with the starting point, because of content or social rea...
متن کاملComparative Analysis of Apriori Algorithm and Frequent Pattern Algorithm for Frequent Pattern Mining in Web Log Data
The growth and popularity of the internet has increased the growth of web marketing. Extracting usage patterns deals with the weblog records to discover user access patterns of Web Pages. Weblog databases provide rich information about what kind of users will access which kind of web pages. Analyzing and exploring regularities in Weblog records can identify potential customers for electronic co...
متن کامل